Recent advances in leveraging human guidance for sequential decision-making tasks

نویسندگان

چکیده

A longstanding goal of artificial intelligence is to create agents capable learning perform tasks that require sequential decision making. Importantly, while it the agent learns and acts, still up humans specify particular task be performed. Classical task-specification approaches typically involve providing stationary reward functions or explicit demonstrations desired tasks. However, there has recently been a great deal research energy invested in exploring alternative ways which may guide may, e.g., more suitable for certain less human effort. This survey provides high-level overview five recent machine frameworks primarily rely on guidance apart from pre-specified conventional, step-by-step action demonstrations. We review motivation, assumptions, implementation each framework, we discuss possible future directions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Group Decision-Making Models for Sequential Tasks

The sequential probability ratio test (SPRT) and related drift-diffusion model (DDM) are optimal for choosing between two hypotheses using the minimal (average) number of samples and relevant for modeling the decision-making process in human observers. This work extends these models to group decision making. Previous works have focused almost exclusively on group accuracy; here, we explicitly a...

متن کامل

Decision-Making in Research Tasks with Sequential Testing

BACKGROUND In a recent controversial essay, published by JPA Ioannidis in PLoS Medicine, it has been argued that in some research fields, most of the published findings are false. Based on theoretical reasoning it can be shown that small effect sizes, error-prone tests, low priors of the tested hypotheses and biases in the evaluation and publication of research findings increase the fraction of...

متن کامل

Challenges for Communication Decision-Making in Sequential Human-Robot Collaborative Tasks

Effective communication between teammates is critical to the success of collaboration, including human-robot collaboration. For enabling human robot communication, several modalities are actively being researched — such as, text, speech, visual signals, and legible motion. The design of these modalities is necessary to achieve effective communication; however, it is not sufficient. Communicatio...

متن کامل

Convergence in a sequential two stages decision making process

We analyze a sequential decision making process, in which at each stepthe decision is made in two stages. In the rst stage a partially optimalaction is chosen, which allows the decision maker to learn how to improveit under the new environment. We show how inertia (cost of changing)may lead the process to converge to a routine where no further changesare made. We illustrate our scheme with some...

متن کامل

Structure Learning in Human Sequential Decision-Making

Studies of sequential decision-making in humans frequently find suboptimal performance relative to an ideal actor that has perfect knowledge of the model of how rewards and events are generated in the environment. Rather than being suboptimal, we argue that the learning problem humans face is more complex, in that it also involves learning the structure of reward generation in the environment. ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Autonomous Agents and Multi-Agent Systems

سال: 2021

ISSN: ['1387-2532', '1573-7454']

DOI: https://doi.org/10.1007/s10458-021-09514-w